Bayesian feedback versus Markovian feedback in a two-level atom 
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We compare two different approaches to the control of the dynamics of a continuously monitored 
open quantum system. The first is Markovian feedback as introduced in quantum optics by Wiseman 
and Milburn [Phys. Rev. Lett. 70, 548 (1993)]. The second is feedback based on an estimate of 
the system state, developed recently by Doherty et al. [Phys. Rev. A 62, 012105 (2000)]. Here we 
choose to call it, for brevity, Bayesian feedback. For systems with nonlinear dynamics, we expect 
these two methods of feedback control to give markedly different results. The simplest possible 
nonlinear system is a driven and damped two-level atom, so we choose this as our model system. 
The monitoring is taken to be homodyne detection of the atomic fluorescence, and the control is by 
modulating the driving. The aim of the feedback in both cases is to stabilize the internal state of the 
atom as close as possible to an arbitrarily chosen pure state, in the presence of inefficient detection 
and other forms of decoherence. Our results (obtain without recourse to stochastic simulations) prove 
that Bayesian feedback is never inferior, and is usually superior, to Markovian feedback. However it 
would be far more difficult to implement than Markovian feedback and it loses its superiority when 
obvious simplifying approximations are made. It is thus not clear which form of feedback would be 
better in the face of inevitable experimental imperfections. 
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I. INTRODUCTION 

Quantum feedback arises when the environment of an 
open quantum system is deliberately engineered so that 
information lost from the system into that environment 
comes back to affect the system again. Typically the 
environment is large and would be regarded at least in 
part as a classical system. In the case where the system 
dynamics are Markovian in the absence of feedback, the 
information lost to the environment can be treated as 
classical information: the measurement result. The feed- 
back loop thus consists of a quantum system, a classical 
detector (which turns quantum information into classi- 
cal information) and a classical actuator (which uses the 
classical information to affect the quantum system). 

In general quantum feedback is difficult to treat be- 
cause any time delay or filtering in the feedback loop 
makes the system dynamics non-Markovian. A great 
simplification arises for Markovian feedback, where the 
measurement results are used immediately to alter the 
system state, and may then be forgotten. In this case 
the dynamics including feedback may be described by a 
master equation in the Lindblad form. This was shown 
by Wiseman and Milburn for homodyne detection 
and Wiseman [|| in general. This description of feedback 
has been applied to a wide variety of systems and for a 
wide variety of purposes (see for example Refs P-[l0|). 

In a previous work |Ll| , two of us applied the Wiseman- 
Milburn feedback theory to show that almost [[t2| all pure 
states of a fluorescent two-level atom can be stabilized by 
Markovian feedback based on homodyne detection of the 
fluorescence. That is, by adding an amplitude modula- 



tion to the laser driving the atom proportional to the 
just-measured homodyne photocurrent, the atom would 
obey a master equation having any given pure state on 
the Bloch sphere as its stationary state. Without feed- 
back, the only pure stationary state is the ground state, 
in the absence of driving. That work generalized the ear- 
lier results by Hofmann, Mahler and Hess 13 lj] on the 
same problem in a number of ways. One generalization 
was to study the effect of a non-unit efficiency of the ho- 
modyne detection. This was shown to be deleterious to 
the maximum purity of the stationary states, especially 
those in the upper half of the Bloch sphere. 

For non-Markovian feedback, the master equation ap- 
proach of Wiseman and Milburn cannot be used. How- 
ever, the formalism first used to derive the Wiseman- 
Milburn master equation, quantum trajectories, can be 
used. Quantum trajectories |l5| describe the stochastic 
evolution of the state of an open quantum system condi- 
tioned upon the results of measurements performed upon 
its environment. They were first derived from abstract 
quantum measurement theory ]lr^-p^] but were indepen- 
dently invented in quantum optics for practical purposes 
pfj| , pT 15 1 . In the special case where the system has lin- 
ear dynamics, the measurement is linear (e.g. homodyne 
detection), and the feedback dynamics is linear, the quan- 
tum trajectories including feedback can be solved analyt- 
ically. In this case older techniques, based on quantum 
Langevin equations [p2j-piP] can also be used. However 
for nonlinear systems, a numerical solution of the non- 
Markovian quantum trajectories is the only recourse. 

The simplest system with nonlinear dynamics is the 
two-level atom. Non-Markovian feedback for controlling 
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this system was considered earlier by two of us |25[ . We 
considered the simplest form of non-Markovicity, a time 
delay r in the feedback loop [^6| which was otherwise 
kept exactly as for the Markovian feedback in Ref. [flT|| . 
We showed numerically that the time delay had an ef- 
fect qualitatively similar to that of inefficient detection. 
For the special case where the Markovian feedback would 
stabilize the atom in the excited state we obtained an ap- 
proximate analytical expression for the purity (as mea- 
sured by p — 2Tr[p 2 ] — 1) in the presence of a time delay. 
The result for short delays, which was found numerically 
to be valid for quite large delays, was 

p = 1 — 47-r. (1.1) 

Here 7 is the decay rate for the atom. That is, the attain- 
able purity decreases linearly with the time delay. This 
appears to be true in general for this system. 

It should not be concluded from this result that non- 
Markovian feedback is necessarily worse than Markovian 
feedback. A different paradigm for quantum feedback 
has recently been developed by Doherty et al. |^,|8| . It 
is based on an analogy with classical feedback according 
to so-called "modern control theory" |^] . Conceptually, 
the change is from basing the feedback directly on the 
measurement results, to basing the feedback on an esti- 
mate of the system state. That state estimate is of course 
based on the measurement results, but the extra step usu- 
ally makes the feedback non-Markovian from the point of 
view of the system. That is because the best state esti- 
mate will use all previous measurement results, not just 
the latest ones. 

Determining the conditioned state of the quantum sys- 
tem from classical measurement results is a quantum 
version of Bayesian reasoning. Classical Bayesian rea- 
soning updates an observer's knowledge of a system (as 
described by a probability distribution over its variables) 
based on new data p9[ . For this reason, we call feed- 
back based on a state estimate Bayesian feedback. In 
classical control theory it is common to replace Bayesian 
feedback with a simpler approximation to it. For exam- 
ple, a linearization approximation leads to the Kalman 
filter, which makes the feedback a linear functional of the 
observed current |29(| . The quantum version of this was 
explored in Refs. ]27| , |28[ , and had previously been treated 
in Ref. f(|. 

In this paper we investigate what improvement is of- 
fered by Bayesian feedback over Markovian feedback for 
the simple problem discussed above, stabilizing an arbi- 
trary state of the two-level atom. We begin in Sec. II 
by discussing the different sorts of feedback in a general 
context. Then we present the specific system of interest, 
the two-level atom, in Sec. III. This is more general than 
that considered previously JT^,[i||{L4].|5| m that we in- 
clude a term in the master equation corresponding to de- 
phasing, as caused for instance by elastic collisions with 
other (background) atoms. In Sec. IV we present and 
discuss the performance of Markovian feedback in this 



system. In Sec. V we do likewise for Bayesian feedback. 
In Sec. VI we consider the prospects for approximating 
this Bayesian feedback so that the feedback is a linear 
functional of the current. We conclude with a discussion 
in Sec. VII. 



II. QUANTUM FEEDBACK 
A. Quantum Trajectories 

Quantum trajectories are the stochastic paths followed 
by the state of an open quantum system conditioned on 
the monitoring of its environment. In this context, the 
state of the system mean the state of knowledge of the 
system that an ideal observer (unlimited by computa- 
tional power) would have given the results of the mon- 
itoring. As we cannot assume that this monitoring will 
give complete knowledge of the system, the quantum tra- 
jectory will not be a path in Hilbert space. Rather, it will 
in general be a path in the (Banach) space of state matri- 
ces p. This path is generated by stochastic and nonlinear 
equation for the conditioned state matrix, which we call a 
stochastic master equation (SME). Its classical analogue 
is the Kushner-Stratonovich equation for a probability 
distribution |||. 

The system may be coupled to many independent 
baths, but let us assume for simplicity that only one bath 
is monitored. Then we write the (deterministic) master 
equation as 

p = Cp = C p + V[c]p, (2.1) 

where the last term, described by the Lindblad |u| su- 
peroperator T)\c]p = epe' — {c ! c,p}/2, is that which is 
"unraveled" |l5| by monitoring the relevant bath. This 
monitoring yields a current I(t), and we denote the state 
conditioned on this record l[o,t) = {I( s ) ■ < s < t} up 
to time t by pi(t). The SME for this conditioned system 
state pi can then be written |32| 

dpi = Cpidt +Upidt, (2.2) 

where 

Up = {I-I)dt{M-M)p. (2.3) 

Here / represents the measurement result in the in- 
finitesimal interval [t,t + dt), which has the expected 
value E[i] = /. The notation A4, on the other hand, 
represents TrLA/To], where M. is a superoperator. The 
form of Eq. (|7|) guarantees two necessary conditions: 
Tr[Up] = 0, and E[Up] = 0. These imply that the SME 
preserves trace and, on average, reproduces the master 
equation. In addition, U must satisfy 

{tt, (D[c] +U)tt} + dt[Uir][Uir] = (V[c\ +U)n (2.4) 

for an arbitrary rank-one projector n. This implies that, 
if T>[c] were the only irreversible term, the monitoring 
would maintain the purity of the state. 
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For the case of homodyne detection we have pi 



C. Bayesian Feedback 



M.p = cp + pc* 



(2.5) 



The homodyne current / is a real- valued stochastic vari- 
able satisfying 



(Idt) 2 



dt. 



and 



I = M= Tt[p(c+c i )}. 
In other words, 

Idt = Tr[pi(c + J)]dt + dW , 



(2.6) 



(2.7) 



(2.8) 



where dW is an infinitesimal Wiener increment |3J] . 

So far we have considered efficient detection. If an ef- 
ficiency rj < 1 is included then the conditio nal e volution 
will no longer preserve purity. However Eq. (2.2) still ap- 
plies. The only difference is that in the equations for M. 
and I, c is replaced by y/rje. In particular, I(t) becomes 



Idt = 0?Tr[p!(c + J)]dt + dW . 



(2.9) 



This is simple to understand, as the Lindbladian T>[c] can 
be split into r]V[c] + (1 — T])T>[c], with only the former be- 
ing unraveled. 



B. Markovian Feedback 

Consider Markovian |}5| feedback of the homodyne 
photocurrent. Since this current is singular and of indef- 
inite sign, the only possible form of Markovian feedback 
is via a Hamiltonian 



H fb (t)=F(t) x J(t), 



(2.10) 



with F an Hcrmitian operator. 

Although Hfb at time t contains the current / at the 
same time, it must act after the measurement. Taking 
this, and the singularity of I(t) into account, yields the 
following stochastic equation for the conditioned system 
state with feedback [Q 

dpi = dt {Copi + V[c]pi + V[F]pi - i[F, Mp}} 

+ (I-I)dt(M' -M')pi. (2.11) 



Here 



M'p= Mp-i[F,p}. 



(2.12) 



As noted in the introduction, the great theoretical con- 
venience offered by Markovian feedback is that it is a sim- 
ple matter to remove the nonlinearity and stochasticity 
in this equation by taking an ensemble average. This re- 
places I{t) by I, yielding the Wiseman-Milburn feedback 
master equation 

p = C p + V[c]p - iy/rj[F, cp + pJ] + V[F]p . (2.13) 



Following the lines sketched by Doherty et al. |27| we 
now consider controlling the system dynamics using a 
Hamiltonian that depends not directly on the current, 
but rather on the observer's state of knowledge of the 
system p\. By definition there is nothing better with 
which to control the system. We thus have in general 



#fb = F(t, pi) . 



(2.14) 



It is an odd fact about Bayesian feedback that, although 
strictly it is non-Markovian, if the experimenter con- 
trolling the system has perfect knowledge of the system 
dynamics, then the system state actually does obey a 
Markovian equation, namely 



dpi = dt [C+U] pi - i[F(t,pi),pi] 



(2.15) 



However it is not possible to average over the stochastic- 
ity to obtain a master equation. This reveals the under- 
lying non-Markovicity. 

The presence of a nonlinear stochastic Markovian 
equation for the conditioned system state is an artifact 
of the assumption of perfect knowledge of the system dy- 
namics. In reality the system dynamics would not be 
known perfectly, and the experimenter's estimate pi of 
the system state p i wo uld be governed by an equation 
different from Eq. (2.2), namely 



where 



Here C, 



dpi = Cppidt +Uppidt, 



U p p = (I — Ip)dt{M - Mp)p. 



(2.16) 



(2.17) 



„p is an approximation to C. The approximation 
may be necessary due to lack of information, or it may 
be convenient to allow a simpler treatment of the sys- 
tem. This approximation may depend on the estimated 
system state pi. The stochastic unraveling superoperator 
U may also be approximated for reasons such as these, 
with M. replaced by M.. However in Eq. ( 2.17 ) we have 
shown it as approximate for a necessary reason, namely 
that in general it depends upon an estimate of p, p, in 
order to evaluate / and M. 

Linearization of dynamics is a good example of a con- 
venient approximation. It is typically applied to systems 
with infinite dimensional Hilbert spaces, corresponding 
to a classical phase space. Under linear dynamics of 
such a system, the conditioned state of the system will 
tend towards a Gaussian state. For a system with N 
co-ordinates (2A^ phase-space variable), the state pi is 
describable by 2A^ 2 + 3N variables, recording the covari- 
ance matrix and the means. This compares with of order 
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D 2N real numbers required to record pi, where D is an 
approximation to infinity. Moreover, the equation for 
the co-variance matrix is deterministic, and that for the 
means is linear. This is what leads to the Kalman filter, 
where the feedback is a linear functional of the observed 
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current 29 1. 

If the experimenter's best estimate of the system is pi 
then with the feedback included this estimate would still 
obey a Markovian equation, namely 



dpi = dt £ 



pi - i[F(t, pi),pi] 



(2.18) 



However a second, more diligent, observer would use the 
full knowledge of the system dynamics to obtain the sys- 
tem state pi. This would obey the stochastic master 
equation 



dpi = dt [C+U] pi-i[F(t,pi),pi] 



(2.19) 



Note that this is not a Markovian equation for pi, because 
the feedback depends on the estimate pi. The two equa- 
tions together are Markovian, and in control theory lan- 
guage this would be considered an example of Markovian 
control. However from the usual perspective of quantum 
mechanics, where the "system" is the quantum system, 
not the quantum system plus control loop, this is an ex- 
ample of non-Markovian feedback control. 



III. THE SYSTEM 

The simplest nonlinear system to consider is an atom, 
with two relevant levels {]<?), |e)} and lowering operator 
(j = \g) (e|. Let the decay rate be unity, and let it be 
driven by a resonant classical driving field with Rabi fre- 
quency 2a. Furthermore, let us add dephasing of the 
atomic dipole at rate T. 



A. The Master Equation 

The evolution of this system is described by the master 
equation 



p = V[a]p - ia[a y ,p] + TV[a z ] = Cp . 



(3.1) 



In this master equation we have chosen to define the 
&x = cr + <jT an d <j y — i a _ j,jt quadratures of the atomic 
dipole relative to the driving field. The effect of driving 
is to rotate the atom in Bloch space around the y-axis. 
The state of the atom in Bloch space is described by the 
three- vector (x,y,z). It is related to the state matrix p 
by 



p = i (7 + xa x + y<Ty + za z ) . 



(3.2) 



It is easy to sh ow that the stationary solution of the 
master equation (3.1) is 



BS (1 + 2r) + 8a 2 ' 

Vss = 0, 

-(i + 2r) 



(1 + 2r) + 8a 2 



(3.3) 
(3.4) 
(3.5) 



For r fixed, this is a family of solutions parameterized by 
the driving strength a e (—00,00). All members of the 
family are in the x-z plane on the Bloch sphere. Thus 
for this purpose we can reparametrize the relevant states 
using r and 9 by 



x = r sin , 
z = r cos 9, 



where 9 G [— tt,tt]. Since 

p = 2Tr[/9 2 ] - 1 = x 2 +y 2 + z 2 , 



(3.6) 
(3-7) 



(3.8) 



is a measure of the purity of the Bloch sphere, r = 
\J x 2 + z 2 , the distance from the centre of the sphere, is 
also a measure of purity. Pure states correspond to r = 1 
and maximally mixed states to r = 0. The stationary 
states we can reach by driving the atom are limited, and 
generally far from pure [ pl| . In particular, they are con- 
fined to the lower half of the Bloch sphere, as shown in 
Figs, [l] and ||. 
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FIG. 1. Locus of the ensemble average solutions to the 
Bloch equations with detector efficiency n — 0.8 and dephas- 
ing rate L = under various conditions: (a) no feedback 
(driving only); (b) Markovian feedback; (c) Bayesian feed- 
back. The dashed line is the surface of the Bloch sphere which 
is stabilizable for r\ = 1 by Bayesian feedback and (except for 
the equatorial points) by Markovian feedback. 
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FIG. 2. Locus of the ensemble average solutions to the 
Bloch equations with detector efficiency r\ = 1 and dephas- 
ing rate F = 1/20 under various conditions: (a) no feedback 
(driving only); (b) Markovian feedback; (c) Bayesian feed- 
back. Fhe dashed line is the surface of the Bloch sphere which 
is stabilizable for F = by Bayesian feedback and (except for 
the equatorial points) by Markovian feedback. 



B. Homodyne Measurement 

Now consider subjecting the atom to homodyne de- 
tection. We assume that all of the fluorescence of the 
atom is collected and turned into a beam. This could be 
achieved in principle by placing the atom at the focus of 
a parabolic mirror, but in practice it is more likely to be 
achievable in a cavity QED setting J36|, with the atom 
strongly coupled (g) to to a single cavity mode, which 
is strongly damped (k). Then the combined system acts 
like an effective two-level atom, and the output beam of 
the cavity is effectively the spontaneous emission of the 
atom, with the rate (which we have defined as unity) 
being 0(n/g 2 ). Under homodyne measurement of the x 
quadrature of the output field, the conditioned state will 
continue to be confined to the x-z plane. In this case the 
homodyne photocurrent is given by 

I(t)dt = y/jjTr[pia x ]dt + dW{t) , (3.9) 

and the measurement superoperator by 

Mp= ^q(ap + pa^). (3.10) 

The conditioning SME is thus 

dpi = Cpidt+{I -I)dt{M- M)pi. (3.11) 



IV. MARKOVIAN FEEDBACK 

Markovian feedback in this system has been considered 
before pH , except for the effect of dephasing T. This can 
be treated by the same techniques, so our presentation 
here will be brief. The aim of this feedback scheme, and 
indeed all feedback schemes considered in this paper, is to 
make the stationary state of the atom as close as possible 
to a pure state \6q), defined by 



%) = cosy|e) +siny|#). 



(4.1) 



Here 8o is a given parameter in [— it, it). The state |0o) 
is a state with r and 0, as defined above, given by r = 1 
and 8 = 8q. 

Since the desired state is in the y = plane, control 
of the atomic state can be effected by a feedback Hamil- 
tonian proportional to a y . For Markovian feedback we 
have 



Ha, = I(t)\a y /y/rj, 



(4.2) 



where A is the feedback parameter. Since the driving 
Hamiltonian is aa y , this feedback is physically realized 
simply by modulation of the driving. 

The deterministic master equation including feedback 
is, in the Lindblad form, 

A 2 

p = -i{a<T y ,p] + V\a — i\a y ]p + —V[a y ]p + TV[<j z ]. 

(4.3) 

We do not know a priori what values of A and a to choose 
to give the best results. Hence we simply solve for the 
stationary matrix in terms of a and A. Using the Bloch 
representation we find 



-4a(l + 2A)/A 



Vss = 0, 

z ss = -(1 + 2A) (1 + 4A + 2r + A\ 2 /r,) I D, 



(4.4) 
(4.5) 

(4.6) 



where 



D 



8a 2 



(l + 4A + 2r + 4A 2 /^) 
x (1 + 2A + 2A 2 /r ? ) . 



(4.7) 



The "best results" for the feedb ack s ystem is achieved 
by maximizing the radius r in Eq. ( |3.6| ) and Eq. ( |3.7| ) for 
each 8q. From these two equations we have 



tan 6 = x &s /z s . 



(4.8) 



From Eqs. (4.4) and (4.6) we can immediately find the 
desired driving in terms of A and 8q as 



a = (1/4 + A + T/2 + \ 2 /rj) tan0 O - 



(4.9) 
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The aim is then, for each 0o, to find the feedback A which 
maximizes 



(1 + 2 A) cos O 



1 + 2A + 2A 2 /ry + (r - 1 /2) sin 2 O 
We find that 

max r ss = r , 
where r is the solution of 

= rl [(1 - 77 + cos 2 6» ) /2 + T sin 2 O ] 
+ r (1 - 77) cos 9 a - t/-(cos 2 0o)/2 . 
This maximum is achieved for 
A 



-2(i 
2 v 



r 1 cos 6»o ) 



(4.10) 
(4.11) 

(4.12) 

(4.13) 
(4.14) 



Note that for 77 ^ 1, this optimal A, and the resultant 
ro, were only found numerically in previous work [ pd| . 
The analytical results here, which also include r ^ 0, are 
new. 

The curve resulting from Eq. ( 4.13| ) is shown in Fig. |l| 
and Fig. || for different parameters. For prefect condi- 
tions (77 = 1 and r = 0) it is possible to sta bilize any 
state \9q) except those on the equator (see Sec. IV A be- 
low). Under imperfect conditions, the maximum purity 
r ss decreases, with a gap opening up at the equator. For 
inefficient detection, the purity of the optimal states in 
the upper half of the Bloch sphere is affected much more 
than those in the lower half, whereas the two halves re- 
main symmetrical for non-zero phase diffusion. This is 
explicable as follows. In the limit 77 — > (no detection) 
the feedback cannot be effective, so the locus of states 
must reduce smoothly as 77 — > to the no-feedback result 
also shown in Fig. |l|. By contrast, as T increases there 
is no necessity that the no-feedback result should be re- 
covered, and moreover the phase diffusion term r2?[cr z ] is 
unchanged by reflection about the equator (a z — > —u z ). 
A number of cases of interest now need consideration. 



A. Perfect Conditions 

In the case rj = 1 , T = the above parameters simplify 
greatly. We find, in agreement with Ref. [Tl|l , 



a = (cos 0o sin0 o )/4 , 
A = -(l + cos0 o )/2. 



(4.15) 
(4.16) 



With these parameters any |0 O ) can be stabilized, except 
1 6*o I = 7r/2 (that is, on the equator of the Bloch sphere). 
This is clear from the parameters, since the values of a 
and A are the same for 9q — tt/2 and 9q = — tt/2. For a 
two-level system the same master equation cannot have 
two different stable stationary states. Thus the equato- 
rial states cannot be stabilized by Markovian feedback 
even under perfect conditions of efficient detection and 
no dephasing. 



B. Stabilizing the Excited State 



Another case where the parameters (and the purity) 
have simple expressions is for 9q = 0; that is, trying to 
stabilize near the excited stat e. Fo r this we desire .t ss = 
so a = 0. We find from Eq. ( |4.13|) that 



E[r] 



V 



2-t?' 



(4.17) 



for A = —1. Another simple case is stabilizing the ground 
state. This is of course always possible to do perfectly, 
simple by turning the feedback and driving off. 



C. Stabilizing an Equatorial State 



A final case where the purity can be found analytically 
is for 9q — ±7r/2. That is, trying to stabilize an equato- 
rial state. Markovian feedback cannot achieve this at all. 



From Eq. (4.4) and Eq. (4.6), if z ss — then necessarily 
x ss = also. The stochastic conditioned dynamics that 
underly this were explored in Ref. uM. 



V. BAYESIAN FEEDBACK 



Because Bayesian feedback is based on knowledge of 
the conditi oned state pi, we need to examine its evolu- 
tion in Eq. (|T^) in more detail. As noted above, the state 
is confined to the y — plane, so it is very convenient 
to write the evolution in terms of r and 9 as defined in 



Ens. (3.6) and (3.7). Using the Ito stochastic calculus 
|54|, the result is 

dn = { -n (l + cos 2 61 ) /2 - Tri sin 2 61 - cos 6»i 



cos^ 9i 
ri 



2 cos 6*i + n 



[sin0! (1 - rl)} \I{t) - 077-1 shift]} dt , 

(5.1) 



d0! = 



+ V 

+ Vv 



- - r ] sin 9i cos 9i + 2a+ ^— - 
2 / ri 



sin 0i (ri + cos0i) ( 1 j 

7*t 



COS V\ 

n 



[I(t) - Vrj n sin 0iU dt 



(5.2) 



For perfect state-estimation, the experimenter knows 
these values of ri and 0i from the measurement record 



I 



[0,*) 



Now we wish to add feedback, the aim of which is to 
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stabilize the state of the system to be as close as possible 
to a given pure state \9 ). We again consider feedback by 
modulation of the driving Hamiltonian, where the mod- 
ulation can depend in an arbitrary way upon p\ (that is, 
r\ and 8\). This can change 8\ but not r\. To maximize 
the closeness to the state \8q) we wish to force 9i to equal 
9o- This is achieved by the feedback Hamiltonian 



H fb = F(pi)= lim -(3a v (di - 9 ) 



This adds the term 
lim 



-2/3(0i - )dt , 



(5.3) 



(5.4) 



to dOi in Eq. ( f>.2\ ). 

Clearly with the limit f3 — > oo this term will suppress 
all fluctuations in 9i and force it to take the value 9q. The 
SME for the system then reduces to a single equation for 
7"i, found by substituting 9\ = 6>o in Eq. (bj): 



dri 

where 

A{r) = - 



A(n)dt + y/B(n)dW(t), 



(5.5) 



1 + cos z 

COS 2 9 



%) /2-Trsin 
4- 2 cos 6> + 



cos t>o, 



B(r) =7? sin 2 6 (l 



(5.6) 
(5.7) 



Here we are using dW for / — y/rj r\ sin 6\ . 

This stochastic differential equation is equivalent to 
the following Fokker-Planck equation for the probability 
P(r) — Prob[ri = r] 



P(r) = 



9 s 



i d 2 



-B{r) 



P(r). (5.8) 



It is then easy to show 

n is 



2 (dr) 2 ' 

| that the stationary mean of 



Jo rdrC{r) exp [2 /J" dr' A(r')C(r')] 
/o drC{r) exp [2 £ dr> A(r')C(r')] 



(5.9) 



where C(r) = 1/B(r). This will clearly depend upon #0. 

These integrals can be easily solved numerically, and 
the results are shown in Fig. |l| and Fig. ||. Under per- 
fect conditions, any state \0q) can be stabilized perfectly, 
as discussed below. For rj < 1 or T > the purity de- 
creases, in a qualitatively similar way to Markovian feed- 
back. However, the purity for Bayesian feedback is bet- 
ter than for Markovian feedback for almost all 9o, and is 
never worse. 



A. Near-Perfect Conditions 

It is interesting to consider the case of near-perfect con- 
ditions, where r sa ~ 1. This requires r\ ~ 1 and V <C 1. In 



this case, r\ cannot typically wander far from r ss , since it 
is bounded above by unit y. T his suggest that it may be 
possible to linearize Eq. (5J5), because the fluctuations 
are small. Assuming r < 1 and setting r\ = 1 — e with 
e < 1, we get 



A(t) ~ - cos 2 9q x (r - r ) . 



(5.10) 



Here rg is the solution of Eq. (4.13) for r, e -C 1, namely 

r ~ 1 - e(l + l/cos(9 ) -rtan 2 6» . (5.11) 

Clearly this argument only works for 9q ^ ±7r/2. 

Now it turns out that it is not valid to approximate 
B(r) by a constant B(ro) because it varies rapidly when 
r is close to one. However, this is actually irrelevant, 
because as long as A(r) can be approximated by a lin- 
ear function of r plus a constant, the equation for the 
expectation value of r\ is 



-E[r I ]=A(E[r 1 }). 



(5.12) 



In this case, it is clear that r ss = tq. That is, Bayesian 
feedback can offer no improvement over Markovian feed- 
back for the case of near-perfect conditions. This is evi- 
dent in Fig. 



SS 



1 

0.8 
0.6 
0.4 
0.2 



0.2 



0.4 



0.6 



0.8 



FIG. 3. Purity r ss achievable for Markovian (solid) and 
Bayesian (dashed) feedback as a function of r\ for 60 — n/4 
and r = 0. Note that they perform identically for 77 ~ 1. 



B. Stabilizing the Excited State 

It will have been noted from Figs. |l| and || that 
Bayesian feedback is also no more effective at stabiliz- 
ing a state near the excited state |0) than Markovian 
feedback. This ca n be proven analytically. With 9q = 
we have from Eq. (5.5) 



dr 

~dt 



1 



n (I 

2\r 



(5.13) 



which is independ ent o f I(t) (and of T) and has the sta- 
tionary solution (4.17). That is, here is another case 
where Bayesian feedback offers no improvements over 
Markovian feedback. 
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C. Stabilizing an Equatorial State 

One case where Bayesian feedback clearly has an ad- 
vantage over Markovian feedback is for stabilizing an 
equatorial state. At first sight this seems in contradiction 
with Eq. (|5.5|), which for 6q — tt/2 becomes 



rfn 



-[r + (1 - r))/2]ndt + v^C 1 - rl)dW. (5.14) 



This has an expectation value that decays to zero, the 
same as in the Markovian case. However, this equation 
also allows r\ to become negative, which invalidates the 
basis for the equation, namely that 9\ is fixed at 9o so 
that ri is positive and represents the purity. If r\ be- 
comes negative then this indicates that 9\ has switched 
from 7r/2 to —tt/2 say. Bayesian feedback could then cor- 
r ect t his. This can be treated with the above equation 
(HI if 

we assume that whenever n becomes negative 
it is made positive (with its magnitude unaltered). This 
sort of assumption has already been used in Eq. (5.9) in 
setting the lower limits of the integrals to zero. In the 
limit of large T or small 77, where rj will tend to be small, 
we can approximate the coefficient of the noise term in 



Eq. (5.14) by Jr\. Then Eq. (5.9) can be solved analyti- 
cally to yield 



77 



[r + (l-77)/2]7r' 



(5.15) 



VI. LINEARIZED BAYESIAN FEEDBACK? 

The Bayesian feedback of the previous section was un- 
rcalistically perfect in two aspects. First, we allowed for 
infinitely strong feedback. However, this is only fair for 
comparison with Markovian feedback since it also allows 
for infinitely strong feedback, since the current I(t) in the 
Markovian feedback Hamiltonian is a singular function 
of time. Second, we assumed that the state estimation 
was perfect. That is, we assumed that the experimenter 
could solve the nonlinear stochastic Bloch equations in 
real time to obtain p\ and hence 6\. This is a much more 
demanding task than for Markovian feedback, where the 
current is fed-back without any processing. It would thus 
be of interest to see how well Bayesian feedback performs 
if the processing is reduced to a level more comparable 
with that required in Markovian feedback. Specifically, 
feeding back a linear functional of the current would be 
well comparable. 

What we desire is a systematic way of deriving an ap- 
propriate linearized Bayesian feedback of this sort. An 
obvious approach is to linearize the stochastic equations 
of motion for the state vector parameters r\ and 9\. 
Assuming that this feedback does approximate the full 
Bayesian feedback, the linearization for 9\ should be done 
about #o- It then follows that the linearization for r\ 
should be done about the deterministic fixed point of 



Eq. (5JS). That is, the point r satisfying A(r ) = 0. 
But this ro turns out to be exactly the same as the r 
defined by Eq. (1.13). That is, it seems that we should 
linearize about the point that is the stationary solution 
of the Markovian feedback master equation. 

If the linearization of the dynamics were valid, then 
the variables Pi, 81 with linear dynamics would be good 
approximations to the exact variables r\. Let us con- 
sider the best case scenario where the feedback would be 
a good enough approximation to the full Bayesian feed- 
back for fluctuations in 9\ to be completely su ppre ssed. 
Then Pi would obey a linearized version of Eq. ( p.5| ), and 
the stationary mea n solu tion would give r ss . But, as al- 
ready noted in Sec. V A , if the drift term A{f\) consists 
of a constant and a linear term, then the stationary mean 
solution is the fixed point r Q . In other words, E[f] ss = r . 

This fact bodes ill for linearized Bayesian feedback. If 
the linearization were valid then rj ~ fi and so r ss ~ r . 
That is, the linearized Bayesian feedback would do no 
better than Markovian feedback. If the linearization were 
not valid, then there would be no reason to expect the 
linearized algorithm to work at all. It is quite possible 
that by fluke there is some linear functional of the cur- 
rent I(t) that would give a better result than Markovian 
feedback. However that is of no great conceptual sig- 
nificance, if the linear functional is not derived from an 
approximation to the Bayesian theory. 

The linearized Bayesian feedback described above is 
not based on a Kalman filter, because the variables r 
and 9 have no correspondence with classical phase space 
variables. In particular, r is itself a measure of purity, 
and here obeys a (linearized) stochastic equation. In the 
Kalman filter, the purity is determined by the covari- 
ance matrix, which obeys a deterministic equation. It 
might therefore be thought that a better approach to 
linearizing the Bayesian feedback would be to approxi- 
mate the surface of the Bloch sphere by a plane, thereby 
creating an analogue to the classical phase plane. Since 
the Bloch sphere dynamics are confined to the trans- 
verse plane y — 0, the tangential plane reduces to a line, 
parametrized by 9 in the neighbourhood of 9q. 

In this alternative approach, the linearization would 
then be based upon describing the state of the atom by a 
Gaussian distribution P(9) of states \9), localized about 
#o- Averaging over this distribution would give a purity 

r = E[exp (i(9 - 9 ))} ~ 1 - E[(0 - 9 ) 2 }/2. (6.1) 

It is possible to obtain an equation for P(9) by consider- 
ing fictitious noises (corresponding to hypothetical mea- 
surements of the undetected fluorescence and of the bath 
causing the dephasing), and then averaging over them. 
However to obtain a linear feedback algorithm, the re- 
sulting equation must be linearized, thus yielding a Gaus- 
sian solution for P{9) with constant variance. This would 
be possible only if the variance is much less than unity. 
That is, this approach could only work if r were close to 
one. However, we have already seen in Sec.VA that in 
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this regime, the full Bayesian feedback is no better than 
Markovian feedback, so the linearized Bayesian feedback 
cannot do better either. In fact, in this limit, it can be 
shown that direct linearization of the Bayesian feedback 
reproduces the Markovian feedback. 



VII. CONCLUSIONS 

In this paper we have contrasted two different ap- 
proaches to quantum control, Markovian feedback (where 
the current is fed-back with no filtering) and Bayesian 
feedback (where the feedback is based upon an estimate 
of the state). We have applied them to the problem of 
stabilizing the quantum state of the simplest nonlinear 
system, a two-level atom, to be near an arbitrarily cho- 
sen pure state |0o)- Due to the simplicity of our system, 
we are able to obtain all of our results without numerical 
stochastic simulations, as required in previous work on 
Bayesian feedback in nonlinear systems (28| . 

Unsurprisingly, Bayesian feedback never performs 
worse than Markovian feedback. For close to ideal con- 
ditions (small atomic dephasing, and detection efficiency 
close to one), Bayesian feedback performs identically to 
Markovian feedback, except for |0q| — tt/2. In less ideal 
situations, it performs better for almost all values of 6q. 
However, Bayesian feedback is far more demanding ex- 
perimentally than Markovian feedback. That is because 
it relics upon the real-time solution of nonlinear stochas- 
tic differential equations, namely those that determine 
the state-estimate. 

For Markovian feedback, the effect of imperfections 
(such as a time delay in the feedback loop) have been 
studies |E5[ and they are not disastrous if they are small. 
In the present study we have not considered the effect 
of imperfections in Bayesian feedback, and it is not clear 
how disastrous such inevitable imperfections would be. 
As a partial attempt to this question, we have consid- 
ered replacing the full Bayesian feedback with a linearized 
version. This would yield a feedback signal which is a 
linear functional of the feedback current, and so would 
also be experimentally more reasonable and comparable 
to Markovian feedback. Unfortunately we find that any 
systematic approach to such a linearization results in a 
feedback algorithm that would be expected to perform 
worse than Markovian feedback in general. 

Two approaches to linearization for the two-level atom 
were considered. The first is based on treating the pa- 
rameters (r, 0) of the state matrix p as the objects to be 
controlled. The second describes p as a narrow Gaussian 
mixture of state vectors about 0q. In effect, it 

seeks to control the hypothetical "true state" \0). Both 
of these approaches to quantum control were considered 
by Doherty et al. pS| ], but not as paths to a linear feed- 
back algorithm. Indeed, the first approach (which they 
actually discuss last) is described by them as "necessarily 
nonlinear" , although it can of course be linear in some 



cases. The second approach they note is potentially ill- 
defined and in any case "sub-optimal" . 

For the purposes of developing a linearized quantum 
feedback algorithm, Doherty et al. consider only one ap- 
proach to quantum control. This is the one (the first 
they discuss) based on describing the quantum system 
by a quasi-probability distribution on classical phase 
space. This description for quantum systems is natu- 
rally linearized to yield the Kalman filter, as discussed 
in Sec. II C. The negative results obtained here for 
non-Kalman linearized feedback suggests that perhaps 
good linearized quantum feedback control algorithms ex- 
ist only for quantum systems whose state can be well 
described by a classical phase space distribution. This 
of course rules out the two-level atom and other "deep 
quantum" systems. 

To conclude, if there were no restrictions placed upon 
an experimenter's processing ability or knowledge of rel- 
evant parameters then Bayesian feedback would be op- 
timal by definition. Moreover, we have shown that in 
most parameter regimes, it does strictly better than 
Markovian feedback in stabilizing the state of a two- 
level atom. However, for nonlinear systems (such as the 
atom), Bayesian feedback would be far more difficult to 
implement that Markovian feedback. Systematic linear 
approximations to Bayesian feedback fail even to match 
Markovian feedback for the two-level atom, so it is possi- 
ble that inevitable experimental imperfections would un- 
make the general superiority of Bayesian feedback in this 
system. This is related to issues of robustness in classical 
control theory , which has only begun to be explored 
in in quantum systems p8| , |38| |. Quite probably Marko- 
vian, Bayesian, and perhaps other forms of feedback will 
all have roles to play in the control of nonlinear quantum 
systems. 
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